Prosody Modification for Vocoder Based on Amplitude Spectrum of Residual Signal
نویسندگان
چکیده
This paper describes the prosody modification (pitch and duration) for vocoder based on amplitude spectrum of residual signal. In this vocoder, period component is represented as amplitude spectrum of half pitch period length and aperiod component is estimated from the difference of amplitude spectrum between the constructed period signal and the residual signal. Then, pitch modification is conducted by resampling the period spectrum according to desired pitch period length in frequency domain and duration modification is conducted by adjusting the frame shift length in time domain. Listening tests show that the speech quality of proposed vocoder after modification is not decreased so much and can get comparable performance with STRAIGHT.
منابع مشابه
A new F0 modification algorithm by manipulating harmonics of magnitude spectrum
This paper proposes a new speech modification algorithm based on a vocoder framework to synthesize high quality speech. Its innovation is in preserving the fine structure of the magnitude spectrum. A key point is the use of a “compensatory gaussian window” to extract moderate F0 harmonics structures in the magnitude spectrum. The other key point is, starting from the magnitude spectrum, generat...
متن کاملAmplitude Spectrum based Excitation Model for HMM-based Speech Synthesis
This paper describes an excitation model based on amplitude spectrum for hidden Markov model (HMM)-based speech synthesis system (HTS). Residual signal obtained from inverse filtering is decomposed into periodic and aperiodic spectrums in frequency domain. Amplitude spectrum of half pitch period length is reserved as periodic component in synthesis stage and zero-phase criterion and pitch synch...
متن کاملProsody Modification Using Allpass Residual of Speech Signals
In this paper, we attempt to signify the role of phase spectrum of speech signals in acquiring an accurate estimate of excitation source for prosody modification. The phase spectrum is parametrically modeled as the response of an allpass (AP) filter, and the filter coefficients are estimated by considering the linear prediction (LP) residual as the output of the AP filter. The resultant residua...
متن کاملA new synthesis algorithm using phase information for TTS systems
New speech synthesis algorithms capable of flexible prosody (es pecially F0) modification are desired for a high quality TTS syst em. TD-PSOLA is the most popular synthesis algorithm. The al gorithm shows very high quality when F0 modification is limite d. However, the quality degradation due to pitch epoch detection error becomes severe as the F0 modification factor becomes lar ge. On the othe...
متن کاملDesigning Japanese Speech Database Cov for Hybrid Speech Sy
For the purpose of building Text-to-Speech (TTS) system that can generate high-quality and wide range speech in prosody, we conducted speech database construction. As a speech synthesizer, we use a hybrid system which consists of a unit selection module and prosody modification by STRAIGHT (vocoder type high quality analysis-synthesis method). Our viewpoint is to reduce an amount of prosody mod...
متن کامل